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Abstract 

Background: Histone post-translational modifications are critical determinants of chromatin structure and function, 
impacting multiple biological processes including DNA transcription, replication, and repair. The post-translational 
acetylation of histone H4 at lysine 16 (H4K16ac) was initially identified in association with dosage compensation of 
the Drosophila male X chromosome. However, in mammalian cells, H4K16ac is not associated with dosage 
compensation and the genomic distribution of H4K16ac is not precisely known. Therefore, we have mapped the 
genome-wide H4K16ac distribution in human cells. 

Results: We performed H4K16ac chromatin immunoprecipitation from human embryonic kidney 293 (HEK293) cells 
followed by hybridization to whole-genome tiling arrays and identified 25,893 DNA regions (false discovery rate 
<0.005) with average length of 692 nucleotides. Interestingly, although a majority of H4K16ac sites localized within 
genes, only a relatively small fraction (-10%) was found near promoters, in contrast to the distribution of the 
acetyltransferase, MOF, responsible for acetylation at K16 of H4. Using differential gene expression profiling data, 73 
genes (> ±1. 5-fold) were identified as potential H4K16ac-regulated genes. Seventeen transcription factor-binding 
sites were significantly associated with H4K16ac occupancy (p< 0.0005). In addition, a consensus 12-nucleotide 
guanine-rich sequence motif was identified in more than 55% of the H4K16ac peaks. 

Conclusions: The results suggest that H4K16 acetylation has a limited effect on transcription regulation in HEK293 
cells, whereas H4K16ac has been demonstrated to have critical roles in regulating transcription in mouse embryonic 
stem cells. Thus, H4K16ac-dependent transcription regulation is likely a cell type specific process. 
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Background 

Covalent modifications of histone tails are thought to 
alter chromatin structure, thereby controlling DNA 
transcription, repair and replication. Acetylation of lysine 
16 of histone H4 (H4K16ac) has the potential to create 
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or secure binding platforms for transcriptional factors as 
well as other chro matin-modifying enzymes [1,2]. Histone 
H4 lysine 16 is acetylated by males absent on the first 
(MOF, also called MYST1 or KAT8), a highly conserved 
member of the MYST histone acetyltransferase (HAT) 
family. MOF itself was originally discovered in Drosophila 
as an essential component of the X chromosome dosage 
compensation complex (DCC), also known as the male- 
specific lethal (MSL) complex. MOF increases expression 
of X-linked genes in male flies by 2-fold [3-6] and disrup- 
tion of the MOF chromobarrel domain leads to genome- 
wide H4K16ac loss and compromised MSL targeting to 
X-linked genes [7]. In addition to its well defined role in 
dosage compensation, MOF has recently been found at 
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active promoters genome-wide in both male and female 
flies, where it is bound as part of the nonspecific lethal 
(NSL) complex [8-10]. Other members of the MYST 
acetyltransferase family including acute myeloid leukemia 
(MOZ), transcriptional silencing in Saccharomyces cerevisiae 
(SAS2 and YBF2/SAS3), interactions with human 
immunodeficiency virus Tat in humans (TIP60), are also 
known to have transcriptional functions as well as roles in 
DNA damage repair [11-19]. 

In mammals, MOF is essential for development. 
Constitutive ablation of Mof leads to peri-implantation 
embryonic lethality in mice [20,21]. MOF is also essential 
for post-mitotic cell survival as Cre-mediated conditional 
MOF deletion in Purkinje cells induces chromatin 
blebbings and cell death, suggesting Mof has a crucial role 
in maintenance of chromatin structures in vivo [22]. His- 
tone acetylation has been suggested to have a role in both 
transcriptional initiation and elongation as nucleosomes 
present formidable barriers to the passage of Pol II during 
transcriptional elongation [23] and global acetylation in 
transcribed regions is required for increased basal levels of 
transcription in yeast [24-26]. Acetylation of nucleosomal 
histones in the region of transcription start sites (TSSs) 
may stabilize the binding of chromatin remodeling factors 
to promoter regions [27] and/or destabilize nucleosome 
structure [28,29], leading to decreased nucleosome occu- 
pancy at TSSs that facilitates RNA Pol II binding [30]. 
H4K16ac has been shown to impact higher order chro- 
matin structure and create an open, highly accessible 
environment [31,32] changing functional interactions 
between chromatin-associated proteins [32], and serving 
as a switch for altering chromatin from a repressive to a 
transcriptionally active state in yeast and humans [33] . 

Tip60 (Tat-interacting protein) acetylates histones 
H2A, H3 and H4 and plays a role in DNA repair [34]. In 
Drosophila, Tip60 acetylates nucleosomal phospho-H2Av 
and replaces it with an unmodified H2Av [16]. The 
acetylation of histone H4 by Esal (essential SAS2-related 
acetyltransferasel) is required for DNA repair in yeast [11] 
and thus a similar modification may function in mamma- 
lian cells. Depletion of human MOF (hMOF) in cells results 
in a corresponding H4K16ac loss [13,19,35-37] indicating 
that hMOF protein is the primary HAT responsible for 
histone H4 acetylation at K16. Recent studies have 
demonstrated that depletion of hMOF correlates with 
decreased DNA damage-induced activation of ATM and 
prevents ATM from phosphorylating downstream effectors, 
such as H2AX, p53 and CHK2 [13,37]. Furthermore, 
hMOF physically interacts with ATM [13,35] and hMOF 
binds to and acts synergistically with p53 to increase 
H4K16ac and target gene transcription in vitro [35]. In 
yeast, the presence of H4K16ac and H2A.Z synergistically 
prevent the ectopic propagation of heterochromatin in 
subtelomeric regions [33]. 



Consistent with these genetic and biochemical results, 
genome-wide location analysis of HAT binding in yeast 
demonstrated a good correlation with transcriptional 
activation [38]. Numerous combinatorial patterns of 
histone modifications, including those related to HATs, 
in human CD4 + T cells have been studied on a genome- 
wide scale [39,40] showing that modification patterns 
change during cell differentiation [41-43]. Furthermore, 
recent genome-wide binding analyses demonstrated that 
mouse Mof plays a critical role in the Nanog-dependent 
transcription network in mouse embryonic stem (ES) 
cells [44]. In the present studies we determine the 
genome- wide distribution of H4K16ac, the product of 
MOF acetylation, and compared it with MOF-dependent 
differential expression of genes in HEK293 human cancer 
cells. Surprisingly, the potential contribution to gene 
expression of H4K16ac in HEK293 cells is limited, as 
opposed to results from ES cell studies, suggesting that 
H4K16ac-dependent transcription regulation is cell 
type-dependent and that MOF may have other important 
roles in addition to transcriptional regulation through 
H4K14ac modification. 

Results and discussion 

The histone code modification H4K16ac has been 
identified in various organisms as a probable marker of 
actively transcribed genes within euchromatin [45,46]. 
Acetylation of H4K16 is carried out by the MOF 
acetyltransferase and while the genomic distribution of 
MOF bound sites has been determined, a comprehensive 
mapping and analysis of H4K16ac sites has not been 
performed in human cells. To precisely identify all 
H4K16ac genomic locations, we performed chromatin 
immunoprecipitation (ChIP) from HEK293 cells with 
an H4K16ac specific antibody followed by hybridization of 
the co-precipitated DNA to NimbleGen whole-genome 
tiling arrays. Analysis of the tiling data identified 25,893 
high confidence DNA associated regions [false discovery 
rate (FDR) <0.005] with an average length of 692 bases 
that were distributed unevenly throughout the genome 
(Figure 1). In general, there was a tendency for the regions 
near chromosome ends to have areas with high H4K16ac 
levels (41/46) while centromeric regions are largely free of 
H4K14ac (43/46). At a finer level, the majority (82%) of 
H4K16ac modification sites were located within genes: 
45% within introns, 10% of sites 10 kb distal to annotated 
TSS, 10% near transcription end sites (TES) and a rela- 
tively small fraction of binding sites within exonic (3%) 
and coding (2%) regions (Figure 2A). Gene-poor regions, 
designated as intergenic regions, contain only 18% of 
H4K16ac sites. These results are generally consistent 
with the previously described analysis of Mof-associated 
DNA regions [44] in mouse embryonic stem cells (ESCs), 
where about 70% of sites were within genes, although a 
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Figure 1 Chromosomal distribution of H4K16ac sites. HEK293 cell genomic DNA was isolated and subjected to ChlP-ChIP analysis using an 
antibody against H4K16ac and the NimbleGen Human Whole-Genome Tiling Array Set (see Materials and methods). Results are summarized for 
each chromosome with the bar-height representing the regional log 2 H4K14ac intensity > 2-fold. 
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Figure 2 Localization of H4K16ac peaks in the HEK293 genome. (A) Genome-wide distribution of H4K16ac peaks. H4K14ac peak distribution 
was categorized as within exon, intron, coding and other genomic locations as described. Each bar represents the percentage of peaks in each 
category. (B) Correlation of H4K16ac peaks and gene density in human chromosome 1. The bar graph for H4K16ac density and the gene density 
map of human chromosome 1 are aligned. Although these two diagrams are well correlated (I and II), some regions show differences (III and IV). 
(C) Gene ontology for the genes enriched by H4K16ac ChlP. A total of 3,538 genes identified as H4K16ac positive genes are subjected to gene 
ontology analysis. 
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higher percentage of sites (-30%) were also localized in 
coding regions. A general correlation of H4K16ac distribu- 
tion with gene density in human chromosome 1 was ob- 
served (regions I and II), although there were areas where 
the H4K16ac distribution did not correlate with gene dens- 
ity (regions III and IV) (Figure 2B). 

To determine the role of H4K16ac histone modification 
in regulating gene expression, we analyzed the spectrum of 
H4K16ac sites genome-wide for locations within or near 
genes. We found 5002 of the 25,893 H4K16ac sites local- 
ized in 3538 unique genes within a region encompassing 
10 kb upstream and downstream of annotated TSS. 
Gene ontology and KEGG pathways analysis using the 
GeneCodis2 web-based integrative knowledgebase [47] 
for the 3538 genes revealed the highest enrichment for neu- 
roactive ligand-receptor interactions, MAPK signaling, focal 
adhesion, Wnt signaling, apoptosis and cancer pathways 
(Figure 2C). 

Examples of H4K16ac peak localization in promoter/ 
enhancer regions are shown in Figure 3A. The location 
in the chromosome, H4K16ac peaks (red) and gene name 
and exon/intron structure with transcription direction are 
shown. As a potential regulatory mechanism of gene 



expression, we next asked whether these H4K16ac peaks 
in or near genes overlap with transcription factor binding 
sites. Using position weight matrices representing DNA 
target sites for 184 transcription factors reported in the 
TRANSFAC database, we determined the number of 
these transcription factor-binding sites that were over- 
represented near H4K16 acetylation sites (relative to ran- 
domly selected control DNA fragments of the same size, 
see Materials and methods). We found 17 transcription 
factor-binding sites that were significantly associated with 
H4K16 acetylated regions compared to control (p < 0.0005, 
Figure 3B). Interestingly, all 17 transcription factor-binding 
sites were localized near promoters of genes implicated in 
cell cycle progression, proliferation and apoptosis. These 
observations support the previous studies describing the 
role of H4K16 acetylation in cell cycle regulation and cell 
growth [44,48]. The H4K16ac associated DNAs were also 
analyzed to determine whether a unique or novel DNA 
motif was over-represented in the bound DNAs. Analysis 
of all 25,893 binding peaks with the Gibbs motif sampler 
[49] identified a 12-mer G-rich sequence motif (Figure 3C). 
This motif is present at least once in more than 55% of all 
H4K16ac peaks identified in HEK293 cells. Since this motif 
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Figure 3 Distribution of H4K16ac in genes and its associated sequences. (A) Position of H4K16ac enriched fragments with respect to TSS 
(±10 kb of TSS; UCSC hg 18) on 10 representative genes from different chromosomes is shown based on ChlP-chip data. (B) Transcription factor 
binding sites with significant enrichment (p-value <0.0005) within H4K16ac ChlP-chip peaks. Analysis was performed using TRANSFAC. (C) Sequence 
logo of the motif identified in H4K16ac-enriched DNA fragments. 
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is similar to the CTCF insulator binding (CCCTC-binding) 
protein motif (antisense), co-occupancy analysis of 
H4K16ac with CTCF DNA binding proteins was performed 
which revealed that 13,230 unique peaks, corresponding to 
51% of H4K16ac peaks, contained at least one CTCF motif 
(data not shown). 

Given that transcription factor binding motifs overlap 
with H4K16ac sites localized near promoters, it is 
possible that H4K16ac directly regulates gene expression 
through effects on transcription factors. Therefore, we 
also analyzed the H4K16ac peak distributions within 
± 10 kb of gene TSS after dividing the genes into three 
categories based on HEK293 cell expression levels: top 
30%, middle 40%, and bottom 30%. The HEK293 cell 
expression profile data were obtained from published 
microarray data analysis [50]. Within ±10 kb of TSS for 
the top 30% and middle 40% of expressed genes, the 
H4K16ac peak distributions were clearly different from 
those of MOF, which displayed a mono modal distribution 
at the TSS in mouse ESCs as well as human CD4 + T cells 
[44,51]. The H4K16ac density in the vicinity of TSS had a 
V-shape distribution (Figure 4), with a minimum in peak 



intensity near the TSS that was lower in high and middle 
level expression genes than in the low expression group but 
then increased rapidly on either side of high and middle 
expressed genes. Overall, the data indicate that 
H4K16ac containing histone deposition has a complex 
correlation with overall gene expression level in 
HEK293 cells as proposed earlier [33]. 

The H4K16ac distribution profile in HEK293 cells is 
similar to that observed in Drosophila and yeast 
[8,52,53]. On the Drosophila X chromosome, MOF 
displays a bimodal distribution, binding to the promoter 
and 3' end regions of genes, whereas H4K16ac is prefer- 
entially located at the 3' ends. Thus it seems likely that 
MOF distribution is not identical to that of H4K16ac 
and this tendency may be evolutionally conserved. MOF 
forms MOF-NSL complexes that are enriched at TSSs 
while a dosage compensation complex (DCC), also 
known as the MOF-MSL complex, is enriched at 3 f gene 
regions. Although MOF is enzymatically active in both 
complexes, H4K16ac is low at TSSs suggesting that the 
MOF-NSL complex may be either less efficient or sub- 
ject to site or cell type specific inhibition. Alternatively, 




High (30%) 

(Expression > 2.0) 



Mid (40%) 

(2.0 > Expression > 1.2) 



Low (30%) 

(Expression < 1.20) 



10000 



Figure 4 Localization of H4K14ac peaks in genes. Untreated HEK293 gene expression profiling data were used to determine the expression 
levels of entire genes and divided into three categories: top 30% high expression genes, bottom 30% low expression genes, middle 40% genes 
between. In each category, H4K16ac peak distribution with respect to TSS was determined and displayed. 
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H4K16ac modified nucleosomes may be displaced up or 
down from TSS by alternative mechanisms involving 
transcription initiation or elongation. In support of a role 
for H4K16ac in transcription, a recent report demonstrated 
that the enhanced gene expression occurring during X 
chromosome dosage compensation in Drosophila is via 
H4K16ac-dependent enhancement of transcriptional elong- 
ation [53]. Perhaps in some HEK293 genes transcription 
levels are enhanced by the distribution of H4K16ac 
toward 3' ends of genes, possibly as MOF-MSL or similar 
complexes support transcriptional elongation. 

We next analyzed H4K16ac peaks within ±10 kb of 
TSS for genes whose expression was altered in MOF- 
depleted HEK293 cells using differential gene expression 
profiles obtained previously from control and MOF 
knockdown HEK293 cells [20,50]. Microarray profiling 
revealed that 1020 genes (up: 426 genes, down: 594 genes) 
were differentially expressed (fold change; >1.5) [50] which 
is smaller than the 4475 genes that changed more than 
2-fold in expression [44,48] in MOF conditional knockout 
ESCs. Fewer genes may be differentially affected by MOF- 
depletion in HEK293 cells due to cell type differences; ESCs 
are rapidly growing pluripotent cells whereas HEK293 cells 
are kidney cell-derived cancer cells simply undergoing con- 
stant self-renewal. Thus, transcriptional regulation by MOF 
may play a more significant role in ESCs than HEK293 
cells, even though MOF is essential in both situations for 
cell viability and proliferation [20,44]. Alternatively, the dif- 
ferent results may be due to the fact that siRNA-mediated 
MOF depletion is only partial (-85%), since total depletion 
in 293 cells is lethal as is complete Mof gene knockout dur- 
ing mouse embryogenesis [20]. Induced Mof gene knock- 
out in mouse ESCs by Cre expression eventually 
results in complete loss of Mof expression [44,48] and 
these cells therefore may have initiated changes related 
to cell death not observed during partial depletion. 

Finally, we determined the number of genes with 
altered expression following MOF depletion that also 
harbored H4K16ac peaks within ±10 kb of the TSS. 
Surprisingly, there were only 73 such genes, of which 26 
were up- and 47 down-regulated (Figure 5A), a result 
significantly less than the 1295 up-regulated and 1557 
down- regulated genes detected at Mof binding sites in 
mouse ESCs [44]. Ingenuity Pathway Analysis of the 73 
genes indicated a majority were involved in cell cycle, 
DNA replication, recombination, repair, or cancer related 
pathways. The Nanog transcription network, which is a 
critical direct MOF-dependent transcription group in 
ESCs, was not enriched in HEK293 cells. Within the 73 
gene subgroup, the H4K16ac distribution profiles for 
up- and down-regulated genes did not display any unique 
site differences that could account for differential ex- 
pression (Figure 5B and 5C, respectively), though 5-10 kb 
upstream of the TSS, up-regulated genes had a slight 



plateau in distribution that was lower overall compared to 
the more irregular distribution seen in down-regulated 
genes. These results suggest that H4K16ac distribution 
does not correlate highly with transcriptional regulation in 
HEK293 cells, where it may also play a smaller role than it 
does in the critical self-renewal of mouse ESCs [44]. In 
yeast, SAS-I-dependent H4K16 acetylation is reported to 
be independent of transcription and histone exchange 
[52]. It is also demonstrated in yeast that H4K16ac is 
enriched in inactive genes [26]. Although, H4K16ac has 
been linked to X chromosome transcriptional elongation 
in Drosophila [53], MOF activates only a defined subset of 
promoters [54]. In mammalian cells, phosphorylation of 
H3 SerlO during transcriptional activation leads to the re- 
cruitment of MOF, which in turn modulates transcription 
elongation via recruitment of BRD4 [2]. Indeed, MOF in 
Drosophila directly binds in a bimodal fashion to the 
promoters and 3'-end of dosage-compensated genes on 
the male X-chromosome and increases their expression 
[8]. Thus, MOF may have different roles in transcriptional 
regulation using different mechanisms that is organism 
and cell type specific. Indeed, MOF in Drosophila has 
been shown to have a critical role in DNA repair [55]. 

In summary, the present results imply that the role of 
H4K16ac in transcription regulation is cell type- 
dependent and possibly less important in differentiated 
cells where other epigenetic marks, such as DNA 
methylation, may override histone codes [56]. Interest- 
ingly, even though MOF dependent H4K16ac has only 
limited correlation with transcriptional regulation in 
HEK293 cells, it is still required for cell survival. Fur- 
thermore, only 13.44% of DNase I hypersensitive sites 
are localized within 1 kb of H4K16ac peaks in HEK293 
cells (Figure 5D), suggesting that the majority of 
H4K16ac distributes to regions other than DNase I 
hypersensitive sites and loss of H4K16 acetylation, 
therefore, is unlikely to affect the global distribution of 
DNase I hypersensitive sites. Interestingly, it has been 
reported that the function of MOF is largely dependent 
on the activities of tumor suppressors [57]. Therefore, 
the critical role of MOF in transcriptional regulation 
may be in combination with other transcriptional fac- 
tors, such as FOXP3 [57]. In other words, MOF 
dependent transcriptional regulation could be import- 
ant only when cells are exposed to certain stimuli. 
Overall, our results suggest that although MOF may 
have a critical function in transcriptional regulation of 
some genes in specific cell types, additional functions 
other than transcriptional regulation underlie the re- 
quirement for MOF/H4K16ac for cell survival. These crit- 
ical functions may be related to MOF-mediated acetylation 
of chromatin-associated proteins other than H4 or alter- 
ations of chromatin structure induced by such modifi- 
cations that are not detectable by DNasel sensitivity. 
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Figure 5 H4K16ac distribution in genes differentially regulated by MOF status. (A) H4K16ac peak distribution in differentially expressed 
genes. H4K14ac peaks are mapped within ±10 kb TSS with pink bars for each gene and the total H4K16ac distribution is shown in lower panel. 
Genes are aligned according to relative changes in expression of the transcript as determined by microarrays on targeted depletion of H4K16ac 
in HEK293 cells. (B) H4K16ac distribution in down-regulated genes (less than -1.5-fold). (C) H4K16ac distribution in up-regulated genes (more 
than 1.5-fold). (D) DNase I hypersensitive sites are poorly correlated with H4K16ac peaks in HEK293 cells. Total H4K16ac peaks were analyzed for 
DNase I hypersensitive sites. As shown in table, only 13.44% of H4K16ac peaks also contain DNase I hypersensitive sites within the region ±1 kb 
from the peak. 



Materials and methods 

Identification of H4K16ac peaks in HEK293 cells 

HEK293 cells were cultured as described previously 
[13,20,50]. ChIP was performed in triplicate as described 
[50] using anti-H4K16ac antibody (EMD Millipore, 
Billerica, MA). Recovered DNA fragments were purified 
and ChlP-chip was performed in Roche NimbleGen, Inc. 
(Madison, WI). DNA without ChIP processing was used 
as a reference. The differential gene expression profile was 
obtained from GEO (accession number: GSE20193) [50]. 

Raw ChlP-chip fluorescence intensity data obtained from 
NimbleGen Human ChlP-chip 2.1 M Whole-Genome 
Tiling -10 Array Set (probe length: 50-75 mer, median 
probe spacing 100 bp) were analyzed using Roche 
NimbleScan 2.6 software. Settings included probe intensity 
ratio calculation between experimental (ChIP) sample 
and control (non-specific IgG) sample. Enrichment is 
quantified by probe intensity log ratios calculated as the 
base 2 logarithm of the ratio between the experimental 
and control samples. Using the default parameters 



(probe length = 50 bp; P start = 90; P step = 1; # step = 76; 
min probes > cutoff in peak = 4; when all probes in 
peak > cutoff = 2 and width of sliding window = 500 bp) 
peaks were identified at FDR <0.005. All coordinates in 
this manuscript are reported in hgl8 (GEO accession 
number: GSE44254). 

Identification of an H4K16ac binding DNA motif 

To identify DNA sequence motifs in the H4K16ac-bound 
regions detected by ChlP-chip, the sequences of all 25,893 
H4K16ac-bound regions were retrieved from UCSC built 
hgl8 and inputted into the Gibbs sampler motif finding 
program provided by CisGenome. The motif finder was 
run searching for motifs of 12 bp using 5,000 MCMC 
iterations and a score was produced for each motif. 

Transcription factor binding site and DNase I 
hypersensitive site enrichment analysis 

For transcription factors enrichment analysis we first 
generated ten control sequences by randomly picking 
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sequences of same size from the same chromosome. 
Both the peak sequences and control sequences were 
given as input into the Match tool to scan for enriched 
PWMs listed in TRANSFAC professional The total 
occurrence of any given TFBS on each peak sequence 
was considered as the observed frequency. Similarly, the 
occurrence of a TFBS in control set sequences, gave the 
randomly expected frequency. The discrepancy between 
observed and expected frequency was evaluated by 
determining the statistical variable chi-square (x 2 )- 
The presence of CTCF motif within peak sequences 
(FDR < 0.005) was looked for the presence of 9-mer 
CTCF motif (AG[GA] [GT]GG[CAT] [GAT] [CG] and 
their complementary sequence). We found 13,230 unique 
peaks (51% of overall peaks) with at least one 9-mer CTCF 
motif. DNase I hypersensitive sites data for HEK293T cells 
(a cell line derived from HEK293) were used for our 
correlation analysis obtained form: http://genome.ucsc. 
edu/cgi-bin/hgTrackUi?hgsid=302749441&c=chr6&g=wg 
EncodeOpenChromDnase. 
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